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Abstract 

Recent experimental and theoretical studies have shown that small asexual populations evolving on 
complex fitness landscapes may achieve a higher fitness than large ones due to the increased 
heterogeneity of adaptive trajectories. Here we introduce a class of haploid three-locus fitness 
landscapes that allow the investigation of this scenario in a precise and quantitative way. Our main 
result derived analytically shows how the probability of choosing the path of the largest initial fitness 
increase grows with the population size. This makes large populations more likely to get trapped at 
local fitness peaks and implies an advantage of small populations at intermediate time scales. The 
range of population sizes where this effect is operative coincides with the onset of clonal interference. 
Additional studies using ensembles of random fitness landscapes show that the results achieved for a 
particular choice of three-locus landscape parameters are robust and also persist as the number of 
loci increases. Our study indicates that an advantage for small populations is likely whenever the 
fitness landscape contains local maxima. The advantage appears at intermediate time scales, which 
are long enough for trapping at local fitness maxima to have occurred but too short for peak escape 
by the creation of multiple mutants. 

KEY WORDS: clonal interference, finite population, fitness landscape, fixation probability, 
three-locus models 
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Consider a population experiencing a recent environmental change. Assuming that the population is 
ill-adapted t o the new environment, as is typically the case in the beginning of an evolution 



experiment (jLenski and Travisano 



19941 ). the adaptation to the new environment relies on the supply 
of beneficial mutations available to the population. During the early stages of adaptation, it is a 
good approximation to assume that the supply of beneficial mutations is unlimited. Then as the rate 
at which the beneficial mutations appear is given by the mutation rate per individual times the 
population size, a large population is expected to experience more beneficial mutations and hence 
adapt faster than a small population. 

This conclusion does not seem to depend on the topography and epistatic interactions in the 
fitness landscape. If the fitness landscape is non-epistatic in the sense that (beneficial) mutations act 



multiplicatively on fitness, a large population adapts faster than a sr nall population, alt 



advantage is strongly reduced in asexuals due to clona. 



de Visser et al. 



19991 : 



Wilke 



2004 



Park and Krug 



2007 



Park et al 



rough this 



interference (IGerrish and Lenski 



1998 



20m . Even if the fitness 



lands cape is highly epistatic such as a maximal 



y rugged (house- of- c ards) landscape (jKingman 



20081 ) . This leaves the question as to 



19781 ). a large population still wins on average (jPark and Krud l! 
whether small populations might be at an advantage in complex fitness landscapes with an 
intermediate degree of epistasis and r uggedness . 



In a recent experimental work (iRozen et al 



20081 ) studying the adaptation dynamics of 



populations of E. coli in simple and complex nutrient environments, it was found that small 
populations could attain higher fitness than large populations in a complex medium which can be 
expected, on general grounds, to give rise to a rugged and epistatic fitness landscape. The observed 
fitness advantage of small populations was associated with a greater heterogeneity in their adaptive 
trajectories compared to large populations. Specifically, small populations that eventually reached 
the highest fitness levels were often the ones that initially displayed a rather shallow fitness increase, 
whereas the fitness of those that gained a large initial advantage tended to level off quickly. In large 
populations trajectories were more uniform and typically showed a rapid initial fitness increase 
followed by a significant slowing down or saturation. 

These experiments suggest that an adaptive advantage can arise from the higher level of 
stochasticity in the incorporation of beneficial mutations displayed by small populations, provided 
the topography of the underlying fitness landscape is sufficiently complex. As the detailed structure 
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of the experimental fitness landscape is unknown and unfeasible to determine, it is useful to 
investigate this mechanism theoretically. In previous work this w as done using exteiisive s imulations 



20091 ). The 



for a class of random fitness landscapes with tunable ruggedness (jHandel and Rozenl l! 
main conclusion from this study was that an advantage of small populations can be observed in a 
substantial fraction of random landscapes, but the dependence of the effect on parameters such as 
the population size, the mutation rate and the fitness effect of beneficial mutations was not explored 
systematically. 

In this article, we introduce a minimal, analytically tractable model wh ich captures the dynamic 

Rozen et al.l ((20081). We show that the 



behavior of the population fitness in the experiments by 
simplest fitness landscape that can exhibit a small population advantage is a haploid, diallelic 
three-locus landscape where the genotypes of minimal and maximal fitness are separated by three 
mutational steps. There are then 3! = 6 distinct shortest paths leading from the global fitness 
minimum (the wild type) to the global maximum, corresponding to th e different orderings in which 



the mutations are introduced into the population (jGokhale et al 



20091 ). We distinguish between 



smooth paths along which fitness increases monotonically, and rugged paths containing at least one 
deleterious mutation. The three-locus landscape is constructed in such a way that the rugged paths 
contain a local fitness maximum, and they confer the greatest initial fitness increase to a population 
initially fixed for the minimal fitness genotype. 

The existence of rugged paths is the hallmark of sign epistasis, a specific type of genetic 
interaction u nder which a given mutation can be benefi cial or deleterious depending on the genetic 



background (jWeinreich et al 



20051 : 



Poelwijk et al 



20071 ). Sign epistasis implies that at least some of 



the mutational pathways leading to the global maximum of the fitness landscape are rugged, and 
thus inaccessible to an adaptive dynamics that is constrained to increase fitness in each step. In 
addition, sign e pistasis is a necessary (but not sufficient) condition for the existence of multiple 



fitness maxima (I Poelwijk et al 



2011 



. The presence of sign epistasis was established in several recent 
experimental studies, where all combinations of a selected set of individually beneficial or deleterious 
mutations were constructed and their fitness effects (or some proxy thereof) were measured 



(Weinreich et al 



20061 : 



Lozovskv et al 



20091 : 



de Visser et al 



20091 : 



Carneiro and Hart! 



l3). 



20101). There 



is also considerable evidence for the existence of mu 



using bacterial and viral populations (jKorona et al 



tiple fitness maxima from evolutio n experiments 



1994 



Burch and Chao 



199S 



20001 : 
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Elena and Lenski 



20031 ) . It is therefore reasonable to assume that s ign epistasis a nd ni ulitple fitness 



Rozen et al 



(120081 ) 



maxima are present also in the complex environment considered by 

The analysis presented below shows that the speed of adaptation is generally an increasing 
function of population size both along smooth and along rugged paths. However, the probability with 
which a particular type of path is chosen depends on population size in such a way that small 
populations can be favored at least over a certain range of time scales. In particular, as we shall 
show, the probability to choose the rugged path in the three-locus model rises sharply with the onset 
of clonal interference, and it approaches unity when the dynamics becomes completely deterministic 
for very large populations, because t hen the mutation with the largest initial fitness increase is 



certain to fix (jjain and Krug 



l2007bl ). The dynamics of small populations are less predictable, and 
they therefore enjoy an advantage by more frequently avoiding getting trapped at the local fitness 
maximum. The main part of the article is devoted to a detailed, quantitative analysis of this 
scenario. We then show that the mechanism identified within the specific three-locus model is robust 
by simulating populations on variants of the house-of-cards fitness landscape with three or more loci. 



and conclude the paper with a discussion of our key results. 



Models 

FITNESS LANDSCAPES 

In the main part of this work we consider the space of genotypes composed of three loci with two 
alleles each, which will be denoted and 1. Each genotype is assigned a fitness W according to 

VF(OOO) = 1 

VF(OOl) = VF(OIO) = 1 + si 

VF(IOO) = 1 + S2 

VF(Oll) = W{Wl) = W{lW) = {l + sif 

w{ni) = (i + sif 

where S2 > si > and (1 + si)^ < 1 + S2 < (1 + si)^. Thus there is a local fitness maximum at 
genotype {100} and the global maximum is located at genotype {111}. 
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In addition to the landscape equation ([T]), we consider two ensembles of random fitness 
landscapes consisting of L-locus genotypes with two alleles at each site. In the first ensemble referred 
to as unconstrained ensemble, the least fit genotype is assigned the allele at every locus and fitness 
1 while the rest of the genotypes are given fitnesses 



(genotype) = 1 + Sx, 

where S controls the strength of selection and x is a random number drawn from an exponential 
distribution with mean 1. This is Kingman's house-of-cards model adapted to a finite number of 



diallelic loci (jKingman 



19781 : 



Kauffman and Levin 



19871 : 



Jain and Krug 



2007bl : 



Park and Krug 



20081 ). 



It is also instructive to s tudy a constrained version of the above model in which the fittest genotype 



has all loci with allele 1 (Klozer 



20081 : 



Carneiro and Hart] 



2010|). Such landscapes are generated by 



assigning the maximum value amongst 2^ — 1 random numbers given by equation ([2]) as the fitness 
of the antipodal point of the least fit genotype in the genotype space. Note that these landscapes 
have exactly the same statistics as those which are sampled under the condition that the antipodal 
genotype is the fittest among all randomly generated landscapes by equation ([2]) . 



POPULATION DYNAMICS 

We mainly work with a finite population of size which evolves according to standard 
Wright-Fisher dynamics in discrete generations. In each generation, an offspring chooses a parent 
with a probability proportional to the parent's fitness and copies the parent's genotype. Then the 
point mutation process is implemented symmetrically in which o 1 with probability fi. This 
process is repeated until all N offspring have been generated. In the actual simulations, we treated 



the population size of each genotype as a random variab 
multinomial distribution; for details see lPark and Krug 



e wh ich is sampled according to a 



(120071). 



It is also useful to compare the results obtained for finite N with the predictions of the 
deterministic mutation-se l ection dynamics of "q uasispecies" type which applies for infinite 



populations (jBiirger 



2000 



Jain and Krug 



2007al ) . This is done by iterating the deterministic 



evolution equations for the frequencies f{cr,t) of genotype a at generation t, which read 

E.'M{a^a')W{a')f{a',t) 



f{a,t + l) 



Ea'Wia')f{a',t) 



(3) 
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Here M{a ^ a') = fi'^^°''°''\l — fj)^ d{u,a') -g ^j^g probability that an L-locus genotype a' mutates to 
genotype a at Hamming distance d{a,a'). 



Results 

EVOLUTION TIME SCALES 

We begin by a comparison of the time taken to reach the global maximum along the smooth and the 
rugged paths on the fitness landscape defined by equation ([T|). A population that is initially fixed for 
the minimum fitness genotype {000} has a choice between three single site mutations. Two of these 
(to genotypes {001} and {010}) lead the population to a smooth path towards the global fitness 
maximum {111}, whereas the third leads to the local fitness ma ximum 1 100} from which the 



population can escape on 



Weinreich and Chao 



20051; 



y by the creatioii of a double mutant (|lwasa et al 



Weissman et al. 



20091) 



2004 : 



We estimate the time scales of t 



mutation (SSWM) regime (iGillespie 



le re l evant evolutiona ry processes in the strong selection weak 



198.1 



1984 



Qrr 



20021 ). where Nfi <^ 1 and the population is 



monomorphic almost all the time. For simplicity, in the present discussion we neglect back 
mutations towards the wild type genotype {000}. Starting from the wild type, each of the single step 
mu tants is genera ted in the population at rate Nfx and goes to fixation with probability it{s) given 



by (jKimura 



19621 ^ 



7r(s) 



1 



1 



-2Ns 



(4) 



with s = si or 52- For ^ <^ s <^ 1, the fixation probability 7r(s) ~ 2s. The waiting times for low 
fitness (Ti) and high fitness (T2) mutants that will ultimately fix are therefore 

1 „ 1 



Ti 



T2 



(5) 



2nNsi 2nNs2 

Adaptation along one of the smooth paths proceeds by sequentially fixing two additional beneficial 
mutations with selection coefficient si, and the total evolution time is therefore Tsmooth ~ 3Ti. 

By contrast, populations that choose the rugged path need to escape from the local fitness peak 



{100} in or der to reach the glob a 



the lines of 



max imum. The corresponding escape time can be estimated along 



Weinreich and Chad (|2005l ). Following these authors we introduce the selection 
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coefficients 

W{lll) W{WO) 

which express the relative fitness advantage of the global maximum compared to the local peak 
(sben) and that of the valley genotypes compared to the local peak (sdei), respectively. Depending on 
the population size, the peak e scape can proceed through two distinct pathways. In populations 



smaller than a critical size Nc (Weinreich and Chao 



20051 ). the two mutations separating the 



genotypes {100} and {111} fix sequentially, while in larger populations they fix simultaneously. We 
are interested in population size ^ Nc ^ ln{s/fi)/s which can be easily satisfied in the SSWM 
regime as Ns ^ 1. In the simultaneous mode the escape time is given approximately by 

r ^ — (7) 

Assuming all selection coefficients si, S2, Sben, Sdei to be of a similar magnitude s, we see that 

^esc '5 



» 1 (8) 



Tl,2 

whenever /i <C s, which is expected to hold under most conditions. In particular, it is true in the 
SSWM regime because ^ 1 and Ns » 1. Equation ([8]) implies that the evolution time Trugged 
along a rugged path is dominated by the escape time Tesc, and is much larger than Tsmooth- However, 
both equation ([5]) and equation ([7]) share the same dependence on population size N, so once the 
type of evolutionary path is chosen, a large population is always at a relative advantage. 



MEAN FITNESS EVOLUTION AND HETEROGENEOUS ADAPTIVE 
TRAJECTORIES 

Figure [1] shows the evolution of the population fitness obtained from simulations of the 
Wright-Fisher model in the landscape defined by equation ([1]). Each curve contains data averaged 
over many stochastic histories for a given value of A^, keeping other parameters fixed, and starting 
with all individuals at the genotype {000} with lowest fitness. At short times the fitness rises more 
rapidly for larger populations, as expected on the basis of the estimates given in equation dS]) for 
Nfi <^ 1, while for Nfj, > 1 fitness becomes independent of N. Large populations are also seen to be 
at an advantage for extremely long times, beyond 10^ generations. However, for both parameter sets 
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displayed in the figure, the ordering of fitness with increasing population size is reversed in an 
intermediate time interval, which begins at around 2000 generations. 

The origin of this reversal is illustrated in Figure [21 which shows individual fitness trajectories 
for the parameter set of Figure [1] (b) and two different population sizes. Individual fitness 
trajectories display a step-like behavior, which reflect the transitions in the most populated genotype. 
In particular, populations in which the local peak genotype {100} becomes dominant are seen to 
remain trapped at the local peak for a long time (compare to equation dZ])). Although the initial rise 
in fitness is much faster for the large populations than for the small ones, the fraction of trajectories 
that take the rugged path (and thus get trapped at the local peak) also increases with increasing N, 
from 11/20 in the Figure [SJ^a) {N = 10^) to 19/20 in Figure [2l^b) (iV = 10^). As a consequence, the 



fitness after 10^ generations, when averaged over all trajectories, is larger for the smal 



than for the la ,rge ones. Similar t o the e xperimental observations of iRozen et al. 



populations 



(|2008l ) and the 



simulations of lHandel and RozenI (j2009l ). smaller populations reach a higher fitness level because 
their adaptive trajectories are more heterogeneous, allowing them to avoid trapping at the local 
fitness peak in a larger number of trials. 



PATH PROBABILITY 

To quantify the statement that large populations are more likely to take the rugged path, we 
introduce the probability Pr{N) that the rugged path is taken by a population of size N. In 
simulations, the probability Pr was measured by counting the number of events in which {100} 
becomes the most populated genotype for the first time. In Figure [3] these numerical results are 
compared with the analytical expressions (discussed below) and the two are seen to be in very good 
agreement. We see that Pr generally increases with (provided fi is not too small) thus supporting 
our main contention. Before presenting an analytic calculation of Pr covering the full range of 
population sizes, we discuss the limiting cases of very small and very large populations. 

SSWM regime: A/x < 1 

When Nfi <^ 1, the path probability Pr is equal to the probability that the first mutant that will be 
fixed in a population initially monomorphic for the genotype {000} is the local peak genotype {100}. 
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That is, 



vr(s2) 



(9) 



7r(s2) + 27r(si)' 

When selection is weak, in the sense of Nsi^2 ^ !> the fixation probability is given by its neutral 
value vr = and we obtain Pj. = 1/3. On the other hand, for strong selection (and assuming that 
•si,2 ^ 1) we have 



jSSWM 



S2 



(10) 

independent of N ^ which is equal to 0.54 and 0.56, respectively, in the two cases displayed in Figure 

m 



Deterministic quasispecies regime: N ^ oo 

For very large populations the local peak mutant is always present in the population in considerable 
numb ers and can therefore b e expected to dominate the population with a probability approaching 



unity (jjain and Krug 



2007bl ) ■ The quantitative analysis of this case is based on the deterministic 



infinite population dynamics defined by equation ([3|). As the initial population is assumed to be 
fixed at the genotype {000}, for small mutation rates equation ([3]) gives f{cr,t = 1) ~ ^d,{a,ooo) 
is the same for all genotypes at constant Hamming distance from jO OOj. For t > 1, t he gen otypic 



popu 



ation can be determined by a simple construction described by 



Jain and Krug (120051 ) and 



Jain 



()2007l ) in which the population of a genotype increases exponentially with its fitness, starting from 



/(CT,1). 

For our fitness landscape, as the population frequencies /(111, 1) < /(lOO, 1) but the fitness 
values VF(lll) > VF(IOO), it is possible that the genotype {111} becomes the most populated one 
before {100} thus bypassing the local maximum. The population at sequence {100} overtakes that of 
{000} at time ti when /(000,ti) = /(100,ti), which on using f{a,t) oc /('^'OOO)T^(o-)* gives 
ti = —lnfi/lnW{WO). Similarly the time t2 at which the population of the global maximum 
overtakes that of the initial sequence is given by t2 = — 3 In/x/ In VF(lll). Thus the condition for 
bypassing corresponding to t2 < ti reads 

W^{WO) < W{U1) ^S2<si 



which is ruled out by construction. Thus bypassing cannot occur and we conclude that 

lim Pr(N) = 1. 



(11) 



(12) 
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Clonal interference regime 



The phenomenon of interest in this paper occurs in the intermediate range of population sizes where 
Pr increases from its small population value in equation ([9]) to the large population limit in equation 
(|12p . This regime is more difficult to analyze because of the presence of multiple competing mutant 
clones in the population. To find an analytic expression for Pr{N) in this regime, we first reduce the 
three-locus problem into a single locus with three alleles, say A , B, and C with respective fitness 1, 
1 + si, and 1 + S2. The two genotypes {010} and {001} are lumped into a single allele B. The 
mutation from ^ to occurs with probability 2fi and that from yl to C with fi. No other mutation 
is possible, which ensures that either B ox C will be eventually fixed. It is clear that the fixation 
probability of allele B approximates 1 — P^. 

We now present an approximate calculation of Pr for the three-allele model using ideas from 
clonal interference theory. At time zero the population is monomorphic for allele A. We would like 
to determine the probability that an allele B which originates at some time to > fixes at a later 
time tQ + tf. It is plausible to assume that the fixation and origination of a mutation are not 
correlated, and hence to treat the two processes separately. 

Let us first consider the probability pi{t) for the allele B to originate at time t. An allele B 
appears in the population at rate 2N ^ and would, i n the absence of other mutations, go to fixation 



with probability 



Trfgi ) . As is customary in the field (jMavnard Smith 



Desai and Fisher 



1971 



Gerrish and Lenski 



19981 : 



20071 ). we interpret the fixation probability vr as the probability for the mutant 



population to survive genetic drift and, thus, to reach a size large enough for the further time 
evolution to be essenti ally determini stic. Mutations of type B which reach this level are called 



contending mutations (iGerrish 



I2OOII ). and they arise at rate 2NfnT{si). To obtain pi{t) this rate has 
to be multiplied with the probability that the contending mutation in question is the first to appear 
among all possible contenders for fixation. To estimate the probability that no contending mutation 
(of any type) has appeared before time t, we use a Poisson approximation in which the probability 
for the non-occurrence of an event is the negative exponential of the expected number of events. 
Since the expected number of contending mutations arising up to time t is Nij,{tt{s2) + 27r(si))t, we 
conclude that 

pi{t) = 2NfnT{si) exp(-A^/i(7r(s2) + 27r(si))t). (13) 



12 



We next determine the probability p2 that the fixation of the contending mutation of type B is 
not impeded by the appearance of a contending mutation of type C at some time larger than Iq. 
Such a mutation can only arise from the wildtype population A. According to our assumptions, the 
evolution of the frequency x of B alleles after time to follows the deterministic, logistic growth 
equation 

(14) 



dx 

- = s,x{l-x) 



The expected number of C alleles that arise from the wildtype population until the fixation time tf 
is therefore 

N^JL 



Nfi{l - x{t))dt 



dx 



In(xo), 



(15) 



ro Si 

where xq is the initial frequency of the contending B allele. As before, we use a Poisson 
approximation to determine the probability that no contending mutation of type C arises until the 
fixation of the B allele. Since the expected number of such contending mutations is 
— A^/iln(xo)vr(s2)/'Si, we have 

P2 = exp[iV/iln(xo)7r(s2)/si]. (16) 

To obtain the total probability \ — P^- for the B allele to fix we multiply p\ by p2 and integrate over 
the initial time which gives 

27r(si) 



/■oo 

l-Pr{N)=p2 / dto 

Jo 



^N^i ln(a;'o)7r(s2)/s 



7r(s2) +2^(si) 

To complete the analysis we have to determine xq. A naive argument would suggest that 
xq = 1/N because the contending mutation should start from a single mutant. However, this does 
not include the fact tha t the fixation process conditioned on survival is faster than the logistic 



(17) 



growth with xq = 1/A^ ([Hermisson and Pennings 



20051 : 



Desai and Fishe: 



2003). A simple 



approximate way to take into ac count this effect is to 



frequency xq = l/(siA^) » 1/A^ ([Mavnard Smith 
the final result given by 



1971 



et the contending mutant clone start at 

). Inserting this into equation (fT7|) we obtain 



Pr{N) = 1 



2'K[si] 



where 



7r(s2) + 27r(si 
Q{N) = N^i\n{Nsi)TT{s2)/si 



exp(-Q(iV)), 



(18) 



(19) 
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Figure [3] shows that equation (jlSp agrees well with simulations of the three-allele model described at 
the beginning of this section. In the strong selection limit, using tt{s) ~ 2s the above expression for 
Pr can be simplified to give PriN) pSSWMy-Q(N) ^-^^-^ Q{N) 2iV/i ln(iVsi)s2/si. Thus 

the path probability is close to the SSWM value when Q{N) <^ 1 and approaches unity for large 
Q{N). The increase of Pr beyond the SSWM value which ultimately gives rise to the reversal in the 
ordering of fitness with increasing population size in Figure [H takes place when Q{N) is of order 
unity, 

iV/iln(A^s) ~ 0(1) (20) 

where it is assumed that both selection coefficients have a similar scale s. 

It is straightforward to generalize the above derivation of Pr to an L-locus system where L — 1 
initial mutations confer a selective advantage si and one mutation confers a higher advantage 
S2 > si- This merely increases the mutation rate from allele A to allele B to {L — and leads to 
the expression 

^^(^) = 1 - ( f^n'^^Uf , ^M-NnHNs,)7r{s2)/s,). (21) 
tt{s2) + {L- l)7r(si) 

HOUSE-OF-CARDS MODELS 

At this point the question naturally arises as to how generic our results are with respect to the 
number of loci and the structure of the fitness landscape. To address this question, we simulated 
populations evolving in two ensembles of random fitness landscapes, the unconstrained and 
constrained house-of-cards models. In these models the fitness values of genotypes differing by single 
mutational steps a re assumed to be u ncorrelated. While this is not likely to be the case in real 
fitness landscapes (jMiller et al.ll201ll ). the house-of-cards models constitute the conceptually simplest 



realization of a generic, rugged fitness landscape which is essentially parameter-free, apart from the 
overall fitness scale S in equation 

Before discussing the dynamics of adaptation in random landscapes, we may ask how typical the 
three-locus landscape equation ([T]) itself is within the constrained ensemble with L = 3. The main 
topographic features of the landscape equation ([1]) are (i) the existence of a single local maximum, in 
addition to the global maximum, and (ii) the existence of 2 rugged and 4 smooth paths from the 
global minimum {000} to the global maximum {111}. The enumeration of all 6! = 720 possibilities 
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of ordering the fitness values of the 6 genotypes intermediate between the global minimum and the 
global maximum shows that these two features are shared by a fraction of 11/120 ~ 0.0917 of all 
landscapes in the constrained ensemble. However, the evolutionary dynamics of populations is 
determined not only by the order statistics of the fitness landscape, but depends also on the 
magnitude of the selection coefficients assigned to different adaptive pathways. These effects can 
only be addressed by explicit simulations, which we discuss next. 

In Figure m we show the time evolution of the mean fitness, averaged over many realizations of 
the landscape, for both ensembles of random three-locus landscapes; as before, all individuals are 
initially placed at the least fit genotype. In contrast to Figure [H the mean fitness is now a monotonic 
function of population size at all times: When averaged over the landscape ensemble, there is no 
advantage of small populations. This however does not preclude the existence of such an advantage 
for individual landscapes. To test for this possibility, we introduce the probability P{t, N, N') of a 
small population advantage, defined as the probability that the mean fitness of a population of size A'^ 
at generation t is larger than that of a population of size N' at the same generation, where N < N' . 

To determine P{t, N, N') numerically, we first calculated the mean fitness for population size A'^ 
by averaging over 128 independent evolutionary histories on a single landscape, and then compared 
it to that for a different population size N' on the same landscape. Finally the outcome of the 
comparison was averaged over 10^ samples of the random landscape ensemble. In order to avoid 
spurious contributions from cases where the fitness is in fact independent of population size and an 
apparent advantage of small populations arises due to fluctuations, we count only instances in which 
the inequality (1 — a)w{N) > w{N') is satisfied, where w{N) stands for the calculated mean fitness 
for population size N. We choose a = 10"'^, which is large enough to remove the error due to 
fluctuations. Of course, this may also screen out landscapes with very small advantage, but we do 
not think that this effect is substantial, since a mean fitness advantage of less than 0.1% is negligible 
compared to the scale 5 = 0.1 of typical selection coefficients. 

The resulting probabilities for the two ensembles are summarized in Figure O Referring first to 
the constrained ensemble, which mimics most closely the overall topography of the landscape 
equation ([T|), we see that a conspicuous probability for an advantage of small populations begins to 
appear around 10'^ generations and is most pronounced for population size N ~ 10^. Note that in 
Fig. [T]a where the selection coefficient is of the order of 0.1 as in Fig. [5l the fitness advantage of a 
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small population becomes conspicuous when the population with size 10 is compared to that with 
10^ in the time window between 10'^ and 10^. That is, the quantitative as well as the qualitative 
behavior of the house-of-cards model in the constrained ensemble is similar to the three-locus model 
in the previous section. Thus, we may conclude that the landscape equation ([1]) is quite generic 
within the constrained ensemble, in the sense that between 20% and 30% of all landscapes show 
similar dynamical features. In the unconstrained ensemble the probabilities are reduced by about a 
factor of 2, but the overall behavior is the same. 



MORE THAN THREE LOCI 

Within the house-of-cards models, it is straightforward to investigate the effect of varying the 
number of loci L. In the simulations reported in this subsection we allow for at most one mutation 
per individual and generation, and specify the genome-wide mutation proba bility U rather than t he 
mutation probability n per locus. The general relation between U and fj, is (jPark and Krugll2008l ) 
U = 1 - {1 - fj,)^ fiL when /iL < 1. With increasing L the difference between the constrained and 
unconstrained ensembles becomes less important, because populations typically do not reach the 
global maximum within the simulation time, and hence its precise location is irrelevant. In the 
following we therefore restrict the discussion to the standard (unconstrained) model. 

As an illustration, we present simulation results for L = 20 in Figure [6j In this case the 
expected fitness value of the global maximum is ~ 2.44 which is far larger than the mean fitness of 
the population with size 10^ within the observation time. This means that most of evolutions up to 
size 10^ and generation 10^ have not arrived at the global maximum, but rather explore the 
surroundings of the low fitness starting genotype. As in the three-locus case, the mean fitness 
averaged over all landscapes is monotonic in the population size (Figure [6^), and the plot of 
P{t, N, N') in Figure [6b displays a similar, though more pronounced advantage of small populations 
for iV = 10^ - 10^. However, for N > 10^ a new peak is seen to appear in P(t, N, N') at later time, 
which is not present for L = 3. 

We may interpret this behavior in terms of the different evolutionary regimes described by 



Jain and Krugl (l2007bl ). The disappearance of the first peak in P{t, N, N') marks the point where 
the mutation supply rate NU is sufficiently large for the population to easily escape from local 
fitness maxima by the creation of double mutants. Such a population will however still have 
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difficulties to cross wider fitness valleys, and hence it will tend to get trapped at local maxima which 
are separated by three or more mutational steps. The mechanism for an advantage of small 
populations that was found in the three-locus model thus reemerges on a larger scale, giving rise to 
secondary peaks in P{t, N, N'). 



Discussion 

Understanding the effect of population size on the rate of adaptation is a central problem in 



evolutionary theory , which continues to at t ract considerable attent ion (jWeinreich and Chao 



Gokhale et al 



of 



Rozen et al 



200S 



Weissman et al, 



20091 : 



Lvnch and Abegg 



2005 



20ld ). Motivated by the experiments 



20081 ) ■ the present work has addressed a specific aspect of this general problem. In 
the experiments, several populations of E. coli consisting of either 5 x 10^ or 2.5 x 10'' individuals 
were evolved in a complex nutrient medium which can be modeled by a complex fitness landscape. 
The fitness measurements after 500 generations showed that small populations can achieve higher 
fitness than large populations. 

A classic scenario in which a small population can acquire an evolutionary advantag e becau se of 



genet ic drift has been put forward in the framework of Wright's shifting balance theory (IWright 



1931 



) , referred to as SB T in the following discussion. Apart from the intrinsic shortcomings of the 



SBT ( Covne et al, 



19971 ) , however, there are several reasons why it cannot be directly applied to the 



experimental situation of 



Rozen et al 



(|2008l ). First of all, the population in the experiments is not 
structured, and it is therefore not possible for different demes to occupy separate fitness peaks as 
assumed in phase II of the SBT. Second, the number of generations in the experiment is too small 
for the entire population to cross a fitness valley either by the fixation of deleterious mutations 
(phase I of the SBT) or by the simult aneous fixation of individually deleterious but jointly beneficial 



mutations (jWeinreich and Chao 



20051 ). Hence for a proper explanation of the experimental 
observations it cannot be assumed that the population resides at a local fitness optimum from the 
beginning of the process. Rather, the evolutionary trajectories begin in a fitness valley, and the 



dynamics is determined 



population (jRozen et al 



)y th e competition between different fitness peaks that are available to the 



2003) 



In the preceding sections we have demonstrated that under these conditions, small populations 
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may indeed reach higher fitness levels than large ones because they are more likely to evade trapping 
at local fitness maxima. Our detailed study of a three-locus model where a single local fitness peak 
competes with the global maximum has shown that the dynamical behavior of the population fitness 
is not determined by the time scale to acquire beneficial mutation(s) alone and depends on the path 
probability Pr{N) also. This probability increases from a constant value 1/3 to P''^^^^^ and finally 
approaches the deterministic value unity as the population size increases. For the parameter 
regimes in which P.,. is constant in A^, as the waiting times Tj-ugged and Tsmooth both decrease with A^, 
larger populations are at an advantage. 

However when the probability to take a rugged path increases with N, a larger population may 
get trapped at the local fitness maximum thus acquiring lower fitness than a small population. For 
the parameters in Figure [H the path probability exceeds the SSWM value 0.55 for N > 500. From 
equation (jlSp . it is approximately 0.6 for N = 10^ but close to unity for N > 10'* and therefore a 
population of size N ~ 10'^ is able to acquire a higher fitness than larger populations. 

A key result of our analysis is that the regime of population sizes in which this mechanism 
operates coincides with t he onset of clonal interference, which occurs precisely when the criterion in 



equation ()20p is satisfied (jGerrish and Lenski 



19981 : 



de Visser et al 



199S 



Wilke 



2004 



Park and Krug 



20071 ). In the context of the three-allele model discussed previously, clonal interference implies that a 
high fitness clone (allele C) may arise while the low fitness mutant (allele B) is still on the way to 
fixation, thus enhancing the probability for C to fix and increasing the probability for the population 
to evolve along a rugged path. From the criterion in equation (j20p, we can dete r mine the mutation 
probability for the bacterial populations used in the experiment of 



Rozen et al 



(12003). Since the 



small population advantage is observed around A^ = 5 x 10^ and the characteristic size of selection 
coefficients derived from the fitness trajectories in the experiment is ~ 0.1, the estimated mutation 
probability is 

1 



which should be interpreted as a beneficial mutation rate. This estimate is consistent with values for 
the beneficial mutation rate in E.co l i obtained by other experimental approaches, which range from 



10 * to 10 ^ (jHegreness et al 



20061 : 



Perfeito et al 



2003). 



Our theoretical analysis also shows that the advantage of small populations over large ones is 
transient and at sufficiently large times, the fitness of the large population exceeds that of the small 
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population. This reversal occurs when the large population escapes the fitness valley at time 

Tesc ~ (A^/i^)~^ (see Eq. [7]) and should be testable over an experimentally accessible time scale of 10'^ 



generations if t 
experiments of 



le population size exceeds (10^^^) ^ ^ 10^ where we have used (|22|) . In the 



Rozenetal 



(|2008l ). the large populations had a size of 10^ which is two orders 
of magnitude below our prediction and hence the reversal in the ordering of fitness was not observed. 
It would be interes ting to test this eff ect in experiments using larger populations. 



In their work, 



Rozen et al. 



(|2008l ) attributed the fitness advantage seen for small populations to 



the heterogeneity in evolutionary trajectories. This qualitative description can be made precise 
consid ering predictabi lity of the first adaptive step along the evolutionary trajectory. Following 



Qrr 



(120051) and 



Rovl (|2009l ) we define the predictability P2 to be the probability that two replicate 
populations follow the same evolutionary trajectory, which equals the sum of the squares of the 
probability of these trajectories. Specializing to the first adaptive step in our three-locus system, the 
possible outcomes {100}, {010} and {001} occur with probability Pr, (1 — Pt)/'^ and (1 — Pr^jl^ 
respectively so that P2 = -Pr + ^(1 — -Pr)^ which increases from P2 = 1/3 to 1 with increasing 
population size, similar to the behavior of Pr itself. For the parameter range A^/.i ^ 1, N s ^ 1 in 
which Pj. is independent of A^, the predictability P2 < 1 is also a constant and as discussed above, we 
do not expect a fitness advantage for such populations. Thus although small populations will exhibit 
heterogeneous evolutionary trajectories in general, this does not necessarily imply a fitness 
advantage. 

The robustness of our results with respect to modifications of the fitness landscape equation ([T]) 
was addressed within the framework of two variants of the house-of-cards model. From this study we 
concluded that an advantage of small populations is generally not present when the mean fitness is 
averaged over all possible realizations of the landscape; however, in the appropriate range of time 
scales and population sizes identified for the landscape equation ([T]), a non- negligible fraction of all 
landscapes do disp l ay suc h an advantage. This agrees with the results obtained by 



Handel and RozenI (j2009l ) for a class of models with tunable ruggedness, and is also consistent with 



the analysis of t he me an fitness evolution in the infinite sites limit of the house-of-cards model by 



Park and Krud (|2008l ) 



Returning to the evolution experiments that initially motivated this work, we expect the effects 
described here to be observable whenever the underlying fitness landscape displays multiple local 
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maxima. Among the small number of aval 



found in some cases ( de Visser et al 



20091 : 



able empirical fitness landscapes, local n iaxima have been 



Lozovskv et al.l 



2003 ) but not in others (jWeinreich et al 



20061 ). Further work on the structure of empirical fitness landscapes is therefore needed in order to 



assess the generality of the proposed mechanism for an evolutionary advantage of small populations. 
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Figure 1: Average fitness as a function of time on the fitness landscape defined by equation ([T|) for 
II = 10~^, (a) si = 0.1, S2 = 0.25 and (b) si = 0.02, S2 = 0.05, and various population sizes indicated 
in the figures. As a guide to the eye, the maximum fitness values VF(lll) for each case are also drawn. 
The data have been averaged over 10^ histories. Fitness increases with population size at short times 
and at long times, but in both cases this relationship is reversed for a range of population sizes at 
intermediate times. 
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Figure 2: Population mean fitness as a function of time for 20 histories on the fitness landscape 
equation ([T]). The population size is (a) N = 10'^ and (b) 10'^, the selection coefficients are si = 0.02 
and S2 = 0.05 and the mutation probability is /i = 10~^ (same as in Figure [D^b)). The smooth curves 
depict the average over 10^ independent runs. 
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Figure 3: Fixation probability Pr of the allele C for the simplified three alleles system obtained using 
numerical simulations for /i = 10^^ (triangles), 10~^ (circles), and 10~^ (squares). Dotted lines show 
the analytic prediction of equations ()18ll9p . 
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Figure 4: Mean fitness evolution in random three-locus fitness landscapes. Fitness has been averaged 
over 10^ realizations of the landscape and 128 population histories in each realization, with parameters 
[I = 10"^ and S = 0.1, and population sizes as indicated in the figures. Both for (a) the constrained 
ensemble and (b) the unconstrained ensemble the mean fitness increases monotonically with population 
size for all times. 
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Figure 5: Probability of small population advantage in random three-locus fitness landscapes. Plots 
show P{t, N, N') as a function of t, with N' = WN and N as indicated in the figures, fi = 10"^, 
and S = 0.1. Part (a) shows the constrained ensemble, part (b) the unconstrained ensembles. The 
constrained ensemble with the maximum fitness at the antipodal point is more likely to allow small 
populations to have larger mean fitness. 
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Figure 6: (a) Mean fitness evolution and (b) probability of small population advantage for the uncon- 
strained ensemble with L = 20 loci. Parameters are U = 10^^ and S = 0.1, and population sizes as 
indicated in the figures. 
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